Irreversibility resulting from contact with a heat bath caused by 

the finiteness of the system 



Katsuhiko Sato and Ken Sekimoto 
Yukawa Institute for Theoretical Physics, Kyoto University, Kyoto 606-8502, Japan 

Tsuyoshi Hondou 
Department of Physics, Tohoku University, Sendai 980-8578, Japan 

Fumiko Takagi 

Satellite Venture Business Laboratory, Ibaraki University, Ibaraki 316-8511, Japan 

(27/07/00) 

Abstract 

When a small dynamical system that is initially in contact with a heat bath 
is detached from this heat bath and then caused to undergo a quasi-static 
adiabatic processes, the resulting statistical distribution of the system's energy 
differs from that of an equilibrium ensemble. Subsequent contact of the system 
with another heat bath is inevitably irreversible, hence the entire process 
cannot be reversed without a net energy transfer to the heat baths. 
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I. INTRODUCTION 



Ordinary thermodynamics assumes the extensivity of the system in question, and it is 
not applicable directly to finite systems. Hill |l[ developed a framework to deal with systems 
which are moderately large and homogeneous, except for their boundaries. In this frame- 
work, the corrections to the thermodynamic behaviour due to the the effect of the surfaces 
and the edges of the system are incorporated in the form of an expansion in the number of 
the constituent atoms, N. Our interest here is in systems further removed from the ther- 
modynamic limit, such as mesoscopic devices and molecular motors, which are intrinsically 
small and heterogeneous and for which the method of Ref. is not sufficient. Hereafter, we 
call such systems 'small systems'. 

In this paper our purpose is to elucidate the distinctive nature of small systems by 
considering the following process, which we denote by {T\, a\\ T 2 , 02} (See Fig. [l]). 
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FIG. 1. The process {Ti, a\\ T 2 , a 2 } is schematically depicted as (i)=>(ii)=>(iii)=>(iv). The gray 
boxes represent the heat baths at the temperatures indicated therein, and the circles represent the 
small system. The thick solid lines in (i) and (iv) denote the thermal contact between the small 
system and the two heat baths. 

(i) First, a small system is in thermal contact with a heat bath of temperature 7\. (Through- 

out this paper we assume that both the interaction energy associated with the thermal 
contact and the work required to change this contact are negligibly small 0].) 

(ii) We then gradually remove the thermal contact between the system and the heat bath. 

(iii) Next, we change some arbitrary control parameter of the system, a, from its initial 
value ai to a new value a 2 quasi- statically. We measure the work required to make this 
change as the increase of the energy of the small system. 

(iv) Finally, we gradually establish a thermal contact between the system and 
the second heat bath of temperature T 2 . 

We now introduce the concept of the 'reversibility' associated with the process {Ti, ai; T 2 , a 2 }. 
Definition: The process {T\, a\; T 2 , a 2 } is called "reversible" if no net energy is 
transfered, on the statistical average over infinite number of repetitions, from or 
to either heat bath through the composite processes of {Ti, ai; T 2 , a 2 } followed by 
{T 2 ,a 2 ;T\,ai}. If the process {T\, a%] T 2 , a 2 } is not reversible, it is called "irre- 
versible." 

Reversibility, therefore, implies that the statistical average of the work needed for the process 
{Ti, ai,T 2 ,a 2 } is the opposite of that for the process {T 2 , a 2 ; Ti, a\}. In macroscopic systems, 



reversibility holds if and only if T 2 is equal to the temperature of the (macroscopic) system 
after operation (iii). This fact is a prerequisite for the existence of thermodynamics, in which 
the Helmholtz free energy can be used to relate equilibrium states at different temperatures. 
For small systems, however, the situation is completely different: 

Statement : The processes {Ti, a%\ T2, 02} for small systems are irreversible, except 

for some "special" cases. 

It is important to note that for small systems we cannot define the temperature un- 
ambiguously, at least when they are isolated, and the energy of the system at the end of 
operation (ii) is a strictly statistical quantity. (This is related to the fact that the operation 
of removing the thermal contact is intrinsically irreversible, however small the work asso- 
ciated with this operation.) In order to understand intuitively how these features of small 
systems lead to irreversibility, we first describe qualitatively what happens in the processes 
(i)-(iv)- 

In (i), the energy E of the small system fluctuates, and its statistics obey the canonical 
ensemble at temperature T\. In (ii), the energy of the system is fixed at a particular value. 
This energy E is a stochastic variable, and its distribution is given by the canonical ensemble 
at temperature Ti, as long as the removal of the thermal contact with the heat bath is 
sufficiently gentle [[|. In (iii), the energy of the small system changes in such a manner that 
the phase volume enclosed by a constant energy surface, J(E, a), ( see (|22j) in the text ) is 
invariant. This follows from the ergodic invariant theorem ||. With the exception of those 
systems for which J(E, a) has a special functional property, the statistical distribution of E 
at the end of this adiabatic process is no longer consistent with the canonical ensemble at 
any temperature. In (iv), this non-canonical distribution of the energy relaxes irreversibly 
(in the ordinary sense) to the canonical distribution at the temperature T2, whether or not, 
on statistical average, the net energy transfer between the system and the heat bath is zero. 
We note that the essential feature distinguishing small systems from macroscopic systems 
is the distortion of the energy distribution in (iii), which can be neglected in macroscopic 
thermodynamics. 

In the following sections we prove the above statement with an argument based on the 
ergodicity hypothesis of Hamiltonian dynamical systems. The outline of the proof is as 
follows. In § m we prove the three lemmas as preparatory steps for the main statement. In 



III we prove the main statement. In § IV we discuss the physical meaning of the Statement. 



We also show there the necessary condition for the process {T 1; tn; T 2 , a 2 } to be reversible. 



II. THREE LEMMAS 

In order to prove the Statement, we first introduce three lemmas. 

Lemma 1 — The entropy S (see below) remains invariant in the process (iii). 

Proof: We consider an ensemble of the mechanical system which is described by a time 
dependent Hamiltonian H. Let us denote by P(T, t) the normalized distribution function 
of the ensemble at time t, where T is the phase coordinates of the system, i.e., the position 
coordinates and the momenta of the system. The entropy S is defined as, a functional of 
the normalized distribution P, 



S[P(.,t)} = - I P(T,t)logP(T,t)dr, 



l{T} 

where the symbol {T} indicates that the integral is taken over the whole phase space. 

Let us now examine the behaviour of the entropy S with time. First, we note that the 
time evolution of the distribution function is described by the so-called Liouville's equation: 

dP(T,t) ( d dH(T,t) d dH{Y,ty 



dt \dq dp dp dq 



P(T,t) 



where q and p are the position coordinates and the momenta of the system, respectively, 
and we have introduced here the velocity of the system point in the phase space, V(r,t)- The 
velocity V(r,t) satisfies 



d_ 
dt 



^(r,t) = (2) 



which can be checked by the equation of motion of the mechanical system. 
Using ( ), we evaluate the time derivative of the entropy S: 

dS[P(-,t)] f dP(T,t) d(xhgx) 



dt J{r} dt dx 

V (r ,t)P(T,t) log P(T,t)dS r 



dY 

x=P(T,t) 



ISr 

where the symbol Sr represents the surface integral over the surface enclosing the phase 
space and we have used (0) and performed the integration by parts. As we are interested in 
a mechanical system such that all particles are confined in a finite region in position space 



and that the Hamiltonian involves the kinetic energy terms p 2 /2m, P(T,t) vanishes at any 
point on S^r}- The lemma applies to the process (iii), since a quasi-static adiabatic process 
is realized by a time-dependent Hamiltonian. 

Lemma 2 — A canonical distribution is the distribution to maximize the entropy S 
subject to the constraint that the ensemble average of the energy is E, i.e., 

/ H(T)P(T)dT = E, (3) 
J{r} 

where the canonical distribution characterized by the Hamiltonian H and the temperature 
T is defined as 

p ^ T ' H ^mwr (4) 

with Z(T, H) being the normalization constant, 

Z(T,H) = [ e'^dT. (5) 

J{T} 

Proof: Let us examine the difference between the entropies of two distributions, the canonical 
distribution P c and any other distribution P, both being normalized and satisfying the 
constraint condition (H). We find 



S[P c ]-S[P]=J {r} P(T)log^- ) dr, (6) 
where we have used (Bf) and the conditions, 



/ P c (T)dT = I P(T)dT = 1, 

J{T} J{T} 

J H(T)P c (T)dT = J H(T)P(T)dT = E. 



I{V} J{V} 

We have written here the canonical distribution as -P C (T) for the simplicity of notation, 
though precisely it implies P C (T;T, H) in our notation. The right-hand side of (H) is known 
as the relative entropy and has been known to be non-negative, as we easily demonstrate as 
follows: 

L p < r > 1oe 7^ r =k m loe ^ - L [P[T) ~ nn dr 

> 0- (7) 



The inequality in the last line follows from the fact that xlogx — x + 1 > for x > 0. 
The equality holds if and only if x = 1, so that only the canonical distribution realizes the 
maximum value of S. Thus the lemma is proved. 

Lemma 3 — Let (H)(t,h) be the ensemble average of the Hamiltonian H over the canon- 
ical distribution P C (T;T, H). (Hereafter we shall denote, in general, the canonical average 
using the distribution P C (T;T, H) by (-)(t,h), that is, for an arbitrary physical quantity A 
defined on the phase space: (A)(t,h) = I{r} ^(H-PctT; T, H)dT. ) Then (H)(t,h) is monoton- 
ically increasing with T. The entropy S[P C (-;T, H)] is also monotonically increasing with 
T. 

Proof: Differentiating (H)(t,h) with respect to T, we obtain 

d(H)(T,H) _ {{H - (H)( TtH) ) 2 ) iT<H ) 

8T T 2 { ' 

Since the value of H is indeed distributed under the canonical distribution, the right-hand 
side of flS|) is positive. 

Likewise, differentiating S[P C (-;T, H)} with respect to T, we obtain 

dS[P c (-;T,H)} = 1 d(H) iTtH) 

dT T dT ' 1 ' 

We note that the temperature T is positive in most physical situations. Indeed, for the 
Hamiltonian involving the kinetic terms, T must be positive to satisfy the normalization 
condition of the canonical distribution. Thus, the right-hand side of @ is positive. 



III. PROOF OF THE STATEMENT 



Let us consider an ensemble of the small systems whose Hamiltonian is H a , where a is a 
parameter controlled from the outside. We shall analyse the two processes for the ensemble, 
{Ti, ai, T 2 , a 2 } and its inverse {T 2 , a 2 ; Ti, ai} with given the values of the temperature Ti 
and parameters a\ and a 2 . A temperature T 2 is to be determined so that the heat bath of 
the temperature T 2 receives no energy from the ensemble of the small systems during the 
process (iv) of {Ti, a\\ T 2 , a 2 }. First, we consider the process {Ti, a±; T 2 , a 2 }. When detached 
form the heat bath of the temperature Ti (the process (ii)), the ensemble is the canonical 
ensemble characterized by Ti and H ai . The ensemble average of the energy, E 1 , is then given 
by 

Ex = (H ai ) {Tl , Hai) . (10) 

When the parameter a of the system is quasi-statically changed along the process (iii), the 
distribution of the systems, in general, changes. The final distribution is uniquely determined 
by the adiabatic theorem. (We will not write down the explicit form of the distribution, 
since our proof does not depend on the concrete form of the distribution.) We will write 
the distribution of the ensemble at a as P a (T; Ti, H ai ), where Ti and H ai are the arguments 
reminding us of the fact that the ensemble at a — a\ was the canonical ensemble with Ti 
and H ai . By our definition, P a at any temperature T and for any value of a satisfies 

P a (T;T,H a ) = P c (T;T,H a ). (11) 

According to Lemma 1, the entropy S remains invariant during the process (iii): 

S[P ai (-;T u H ai )] = S[P a2 (-,T u H ai )]. (12) 

At the end of (iii) the ensemble average of the energy, E 2 , is expressed as 

E 2 = [ H a2 (T)P a2 (T; Ti, H ai )dY. (13) 

J{V} 

For the process (iv), we choose the temperature T 2 so that the average energy of the ensemble 
does not change upon the contact with the heat bath of the temperature T 2 . It is because 
our aim is to know whether or not the process {Ti, ai; T 2 , a 2 } can be made reversible. T 2 
must, therefore, satisfy 



E 2 = (H a2 ) {T2 ,H a2 ). (14) 

According to Lemma 2, the relations ( |T3D and (JTJj) imply 

SlP^T^HaJ] < S[P c (-;T 2 ,H a2 )}, (15) 

where the equality holds only if the ensemble at the end of (iii) is the canonical ensem- 
ble. If our system is such that the canonical distribution is transformed into the canonical 
one through the process (iii) of {Ti, a\\ T 2 , a 2 }, then it is also true for the process (iii) of 
{T 2 ,a 2 ]Ti,ai}, since (iii) is a quasi-static adiabatic and is, therefore, reversible process. 

Next, we examine the process {T 2 ,a2]Ti,a 1 }. As above, the process (iii) yields the 
relation 

S[P a2 (-; T 2 , H^)] = S[P ai (•; T 2 , H a2 )}. (16) 
The ensemble average of the energy at the end of (iii), E[, is 

E[= f H ai (T)P ai (T;T 2 ,H a2 )dT. (17) 

J{T} 

Now we ask if there is a non-zero flow of energy into the heat bath of the temperature T\ 
at the end of (iii) of the process {T 2 , a 2 ; T\, a{\ when we put the ensemble in contact with 
that heat bath. To answer to this we only need to compare the value of E[ with that of E\ 
since the contact with the heat bath forces the ensemble to obey the canonical distribution 
with the average energy E\. If E[ > Ei, then the positive energy, E[ — E lt flows from the 
ensemble to the heat bath. 

To see if this is the case, it is convenient to introduce the temperature T{ which satisfies 

E i = (Hai)(Tl,H ai ), (18) 

that is, we temporally introduce the canonical ensemble whose the ensemble energy is equal 
to E'. The equations ( |TTD and ( |T8| ) imply, with Lemma 2, that 

S[P ai (-;T 2 ,H a2 )} < S[P c (-X,H ai )). (19) 

Combining (|12D, (p~5|) , (p!6|) and (0), we arrive at the inequality 

S[P c (-;T 1 ,H ai )) < S[P c (-X,H ai )), (20) 



where we have used the property (11) of P a . According to Lemma 3, this inequality (|2 
implies 

Ti < T[ 

and 

Bk < E[. (21) 

Thus we now complete the proof of the Statement: Given the temperature T\ and the param- 
eters a\ and a 2 , no matter what we choose as the temperature T 2 , the process {Ti, a\ \ T 2 , a 2 } 
or {T 2 , a 2 ; Ti, ai} generally requires some non- negative energy to move from the ensemble of 
the small systems to the heat baths. The special case with no energy transfer is the reversible 
case as mentioned below fll5|) , that is, the only case that the canonical distribution form of 
the ensemble is preserved along in the quasi-static adiabatic process (iii). We will discuss 
the condition for this to occur in the next section. 



IV. DISCUSSION 



We first note that the inequality ([H]) is fundamental in the sense that if it were violated, 
we could construct a perpetual machine of the second kind with the following hypothetical 
protocol: 

1. We start from an ensemble of the small systems in contact with a heat bath at 
temperature 7\. 

2. We detach these systems gently from the heat bath, and the change the parameter 
a from a% to a 2 quasi-statically. The work necessary to make this change is E 2 — E\ par 
system. 

3. We now fix the parameter a at a 2 , and introduce the interactions among these system. 
We assume that these interactions are sufficiently smaller than the systemfs energy, but at 
the same time large enough for the repartition of the energy within a certain time. 

4. We remove these interactions : the ensemble of the systems obeys the canonical 
distribution characterized by T 2 and H a2 . (Note that we have not used any heat bath other 
than the initial one at the temperature T\.) 

5. We then slowly change the parameter a from a 2 back to a\. The required work here 
is E[ - E 2 . 

6. Finally, we close the cycle by bringing these small systems into contact with the heat 
bath at temperature Ti. 

If the inequality (E 2 — E%) + (E[ — E 2 ) < were to hold in this cycle, we could obtain 
the positive work E\ — E[ through the cycle, where the only resource of the energy is the 
heat bath at temperature Ti. 



Below we will derive briefly the condition that the cycle of processes discussed above 
become reversible. This condition requires that the distribution remains to be the canonical 
one upon quasi-static adiabatic processes, see the paragraph below (|TJ). The change of the 
distribution in those processes is governed by the adiabatic theorem ||: If we denote by Ei 
and E 2 the energy of the system before and after a quasi-static adiabatic process, through 
which the parameter changes from a\ to a 2 , respectively, the "action" J(E, a) defined by 

J(E,a)= f 6(E-H a (T))dT (22) 

J{T} 

satisfies the following relationship: 



J(E 1 ,a 1 ) = J(E 2 ,a 2 ). (23) 

Using (p3|) we can see how the energy distribution of the system's ensemble changes through 
such process. The energy distribution before the process, P(Ei), is given by construction as 

P(E 1 )dE 1 = 6 Tl W(E u a 1 )dE 1 , 

where Z has been defined below © and W(E,a) is defined by W(E,a) = ^ff 1 - Noting 
that ( p3|) and the above definition of W (E, a) give 



W{E x ,a x )dE x = W{E 2 ,a 2 )dE 2 , 

the energy distribution after the process, P'(E 2 ) is given as 

-Ex 

P\E 2 )dE 2 = 6 Tl W(E 2 ,a 2 )dE 2 . 
H ai ) 

This distribution corresponds to the canonical one at some temperature, say T 2 , if and 
only if 

E\ _ E 2 
T\ = T 2 

is satisfied. Thus, we reach the condition for the reversibility: the adiabatic theorem ( |23| ) 
applied for a quasi-static adiabatic process of the system should yield the relationship 

E 2 = ( p(a 1 ,a 2 )E 1 (24) 

with 4>{a\, a 2 ) being a function of the parameter values before and after the process. 

An example of the systems satisfying ( p4[) is a harmonic oscillator with the Hamiltonian, 



H a — p 2 /2 + aq 2 /2. When the spring constant a is changed quasi-statically, the precess 
{Ti, ai,T 2 ,a 2 } is reversible. By constrast, an example that does not satisfy ( pi"D is given by 
the following Hamiltonian: 

H a =p 2 /2 + e W {—} 



The proof, not shown here, is easy. 

Our proof of (|2l|) is for the systems obeying classical dynamics. After our work, H. Tasaki 
has shown that essentially the same mechanism of irreversibility is found for the systems 
obeying quantum mechanics Jl. There, the proof has been done, just we did here, using the 
fact that the canonical ensemble realizes the maximum entropy among those ensembles with 



the same average energy. We could say that it is this property of the canonical ensemble 
that leads to the inequality fl2"T|). 

In order to obtain a deeper physical insight of the inequality fl2lD , let us comparer the 
system which consists of infinitely many small subsystems connected among each other with 
the system of the ensemble of mutually isolated small systems. We shall call the these two 
systems the "connected system" and the "disconnected system" , respectively. As the former 
system is macroscopic, we can apply to it the ordinary thermodynamics and therefore the 
process {T\, a\ \ T 2 , CI2} can be made reversible for such system. To assure it we must assume 
that the interaction energy assigned to the coupling among the small subsystems is assumed 
to be ignorably small while it is effective enough to attain the thermal equilibrium of the 
whole connected system. Under this assumption we can prove (not shown) that the energy 
distribution of the small subsystems belonging to the connected system remains to be the 
canonical one throughout the process (iii). 

Furthermore the entropy related to this distribution, whose definition has been given in 
( 0), is conserved during the process (iii), as we can show easily by using the fact that the 
distribution is kept to be canonical throughout this process. That is, along the process (iii) 
the canonical distribution P C (T; T, H a ) of the connected system at the parameter value a 
satisfies the following relationship: 

S[P c (-,T 1 ,H ai )]=S[P c (-,f,H a )]. 

This equality combined with Lemma 2 implies that, at any point along the process (iii), 
the average energy of the small systems in the disconnected system is generally not smaller 
than the average energy of the small subsystems in the connected system (see Fig.[| for the 
schematic illustration). This figure gives us the intuitive picture that the irreversibility of 
the disconnected system is caused by its excess energy in reference to the connective system 
which is reversible. 



a 
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FIG. 2. Thick solid curves: The average energy of the small system, E, as a function 
of the parameter a, along quasi-static adiabatic processes. The arrows indicate the direction of 
the processes. Dotted curves: The energy of the combined system per constituent small system 
along quasi-static adiabatic processes. At each extreme point of the curves, the value of the average 
energy is indicated by the corresponding temperature of the canonical ensemble. For example, T[ 
indicates that E = {H ai ) IT i H v At the point indicated by T%, the upper solid curve and the upper 
dotted curve are tangent, and at the point indicated by T\, the lower solid curve and the lower 
dotted curve are tangent. 

It is a future topic of investigation to determine if we can construct a thermodynamic 
framework of small systems that can describe adiabatic processes as well as isothermal 
processes for systems in contact with heat baths. Our results imply that, in such framework, 
if there exists a thermodynamic function whose difference calculated with respect to two 
states is the quasi-static adiabatic work E 2 — Ei , then it cannot be the case that this function 
depends on only T and a. (This is in contrast to the case of isothermal processes for a small 
system in contact with a heat bath. For such processes, using the formalism of stochastic 
energetics |5|d7|] it has been shown that the Helmholtz free energy can be used to determine 
the work necessary to move between two states by changing the value of a sufficiently slowly 
that the small system evolves quasi-statically.) To construct the thermodynamic framework 
of a small system, it is desirable to find a method of characterizing in terms of work the 
process through which the distribution changes from a non-canonical form P a2 (-', T\, H ai ) to 



the canonical form P c (-; T 2 , H a2 ). If this is possible, it is natural to expect that the maximum 
of such extracted work to be T 2 (S[P C (-;T 2 , H a2 )} — S[P a2 (-; T 1; H ai )]) (see ( [T5| )). In any 
case, the quantity S[P C (-; T 2 , H a2 )] — S[P a2 (-; Ti, H ai )} is a strong measure of the distance 
from the corresponding reversible process since this is non-vanishing unless the functions 
P a2 (-;Ti,F ai ) and P C (-;T 2 , H a2 ) are identical. 
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